
MDS History 


Originally contained in a single Netscape LDAP server 
that ended up at NCSA 

Then contained in 2 Netscape LDAP servers at NCSA 

• l server for NASA and the alliance 

• 1 server for everything else 
Now: 

♦ NASA is currently using 4 Netscape LDAP servers at 3 
sites 

♦ 6lobus vl 1.3 totally changed the MDS 

• Mo central servers 

• Marry OpcnLDAP servers 


LDAP 

• Data organized as entries in a tree 

• Entries named using their position in the tree 

• Entries can be added, deleted, modified 

• Searches can be performed over attributes of 
entries 

• Many providers of LDAP servers 


rroc 2000 


HTtHT ^O00 













Uses of MD5 


Improving LDAP Performance 

Distribution of data 

» Sub- trees of data can be placed on different servers 

• Find the server by using referrals 

♦ Supports more updates but stows broad searches 
Replication of data 

* More servers for searches 

♦ More servers to update data 
Indexes 

* Lookup table based on an attribute value 
» Servers can construct and maintain 

♦ Improve search performance 


Host registration and periodic update 

♦ Host, network interfaces, networks, software 
Periodic GRAM reporter updates 

♦ Load, queues, users that have access, running applications 

♦ By default: Every 30 seconds and no user or application 
information 

Finding GRAM contacts for a host 

Determine status of applications 

Currently, complex queries not performed often 


Workload Characterization 

• 20 Hours of trace data 

• Recorded when there was a single server 

• 86,695 connections 

• 633,672 operations 

• 8.8 operations per second 


Operations in Workload 


Operation 

Number of 
Operations 

Percent of 
Total 

Operations 

Percent of 
Operations 
Resulting in Error 

Add 

1,044 

0.73 1 

90.33 

Delete 

81 

0.06 

741 

Modify 

13.4611 

93.84 

2 83 

Search 

77kT 

5.37 

76.10 

Total 

143,446 

100.00 

7.41 


Modifies are from GRAM reporter 
Adds are tried when modifies fail 
♦ Typically adds also fail 


Open Connections 



90 open connections on average 

Periodic spike: Netscape LDAP maintenance? 


Connection Properties 



88% less than 120 sec, 97% less than 240 sec 
97% Have 2 or less operations 
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Classification of AAC)S Connections 

• Classif ied 99.96 % of all connections 

• 67.45% are connections from GRAM reporters 

• 22.34% are connect, bind failure, unbind, close 

• Use this knowledge in our experiments 


Experimental Analysis I 

• Perform trace-driven simulation of accesses to the 
MDS 

• Trace data does not record changes to entries in the 
MDS 

* Use our knowledge of Globus to construct realistic 
entries off -line 

♦ For example, place a random number as the value tn the 
free nodes attribute 

e Simulator: 

• Written in Java 

• Runs on l* workstations 

♦ Simulate accesses in real time, faster, or slower 


Experimental Analysis II 

• Use 1 or 2 UltraSparc workstations to run 
MDS servers 

• Start the server(s) 

• Lood initial MDS contents 

• Run the simulation 

• Record MDS access times in the simulator 
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Single Server Comparison 


Load 

LDAP 

Server 

Add 

(ms) 

Da let* 

(m*) 

Modify 

(ms) 

Search 

(ms) 

Weighted 
Average (ms) 

m 


US® 

mm 

mam 


10 

Vendor 1 

121 

106 

159 

2463 

283 


Vendor 2 

933 

1230l 

903 

1358 

928 
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j Vendor 1 

270 

351 

581 

4031 

] 764 

r. 

Vendor 2 

31722 

36920 

26558 

[ 22945 

] 26407 


• OpenLDAP crashed unless we limited open connections 
to 50 and new connections per second to 20 

• Vendor 2 optimized for search performance 

• CPU load for Vendor 2 was 5.5, Vendor 1 was OJ 


Indexing 

• Hashtable for a specific attribute keyed on the value 

• Advantage: Improve search performance 

• Disadvantage: Must be maintained 

• We added an index to the Vendor l server 

♦ GlobalJobID attribute 

* Improve job status searches which are made over the 
entire tree 

• Improved search performance by 79% 

• Decreased update performance by 38% 

• Slower updates may not be worth faster searches 


2000 


Data Distribution 

* Purpose is to 

• Support very large databases 

. Improve add, modify, and delete performance 
» Can improve search performance 

• For our experiment, we: 

» Distribute data across 2 servers 

• NASA A Alliance on one, everything else on the other 

• Used the LDAP server from Vendor 1 
» Performed a simulation in \ real time 









Future Work 

• Evaluate more conf igurations of LDAP servers 

• Evaluate Globus vl.1.3 MDS 

• Develop a simulation environment with 
synthetic users, machines, daemons 

♦ More flexible experiments 

♦ Planning future needs 
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